Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Goal-Oriented Rectification of Camera-Based Document Images

Identifieur interne : 000562 ( Main/Exploration ); précédent : 000561; suivant : 000563

Goal-Oriented Rectification of Camera-Based Document Images

Auteurs : Nikolaos Stamatopoulos [Grèce] ; Basilis Gatos [Grèce] ; Ioannis Pratikakis [Grèce] ; Stavros J. Perantonis [Grèce]

Source :

RBID : Pascal:11-0188625

Descripteurs français

English descriptors

Abstract

Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Goal-Oriented Rectification of Camera-Based Document Images</title>
<author>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>National and Kapodistrian University of Athens</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gatos, Basilis" sort="Gatos, Basilis" uniqKey="Gatos B" first="Basilis" last="Gatos">Basilis Gatos</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Department of Electrical and Computer Engineering, Democritus University of Thrace</s1>
<s2>67100 Xanthi</s2>
<s3>GRC</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>67100 Xanthi</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J." last="Perantonis">Stavros J. Perantonis</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">11-0188625</idno>
<date when="2011">2011</date>
<idno type="stanalyst">PASCAL 11-0188625 INIST</idno>
<idno type="RBID">Pascal:11-0188625</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000148</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000625</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000116</idno>
<idno type="wicri:doubleKey">1057-7149:2011:Stamatopoulos N:goal:oriented:rectification</idno>
<idno type="wicri:Area/Main/Merge">000568</idno>
<idno type="wicri:Area/Main/Curation">000562</idno>
<idno type="wicri:Area/Main/Exploration">000562</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Goal-Oriented Rectification of Camera-Based Document Images</title>
<author>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>National and Kapodistrian University of Athens</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gatos, Basilis" sort="Gatos, Basilis" uniqKey="Gatos B" first="Basilis" last="Gatos">Basilis Gatos</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Department of Electrical and Computer Engineering, Democritus University of Thrace</s1>
<s2>67100 Xanthi</s2>
<s3>GRC</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>67100 Xanthi</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J." last="Perantonis">Stavros J. Perantonis</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos</s1>
<s2>Athens GR-15310</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens GR-15310</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">IEEE transactions on image processing</title>
<title level="j" type="abbreviated">IEEE trans. image process.</title>
<idno type="ISSN">1057-7149</idno>
<imprint>
<date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">IEEE transactions on image processing</title>
<title level="j" type="abbreviated">IEEE trans. image process.</title>
<idno type="ISSN">1057-7149</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Accuracy</term>
<term>Cost lowering</term>
<term>Curved surface</term>
<term>Degradation</term>
<term>Digitizing</term>
<term>Document image processing</term>
<term>Image analysis</term>
<term>Image quality</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Robustness</term>
<term>Semiautomatic method</term>
<term>Signal distortion</term>
<term>Warping</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Traitement image document</term>
<term>Numérisation</term>
<term>Gauchissement</term>
<term>Dégradation</term>
<term>Evaluation performance</term>
<term>Reconnaissance optique caractère</term>
<term>Distorsion signal</term>
<term>Qualité image</term>
<term>Diminution coût</term>
<term>Surface courbe</term>
<term>Robustesse</term>
<term>Précision</term>
<term>Méthode semi-automatique</term>
<term>Analyse image</term>
<term>Reconnaissance forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Numérisation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Grèce</li>
</country>
</list>
<tree>
<country name="Grèce">
<noRegion>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
</noRegion>
<name sortKey="Gatos, Basilis" sort="Gatos, Basilis" uniqKey="Gatos B" first="Basilis" last="Gatos">Basilis Gatos</name>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J." last="Perantonis">Stavros J. Perantonis</name>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
<name sortKey="Stamatopoulos, Nikolaos" sort="Stamatopoulos, Nikolaos" uniqKey="Stamatopoulos N" first="Nikolaos" last="Stamatopoulos">Nikolaos Stamatopoulos</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000562 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000562 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:11-0188625
   |texte=   Goal-Oriented Rectification of Camera-Based Document Images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024